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4 <110> APPLICANT: Dunn -Coleman, Nigel 

5 Goedegebuur, Frits 

6 Ward, Michael 

7 Yao, Jian 

9 <120> TITLE OF INVENTION: BGL4 Beta-Glucosidase and Nucleic Acids 
10 Encoding the Same 

12 <130> FILE REFERENCE: GC696 

14 <140> CURRENT APPLICATION NUMBER: US 10/027,000 

15 <141> CURRENT FILING DATE: ^04-12>3r^ 
17 <160> NUMBER OF SEQ ID NOS k 3 

19 <170> SOFTWARE: FastSEQ forSfindows Version 4.0 

21 <210> SEQ ID NO: 1 

22 <211> LENGTH: 2976 

23 <212> TYPE: DNA 

24 <213> ORGANISM: Trichoderma reesei 

26 <400> SEQUENCE: 1 

27 ttatagtcgc tttgttaaat tggcctcgag gtcgacccac gcgtccggct gcttgtcccg 60 

28 ttcctgtgcc tgatgtctat ctgccgttgg cctcctcatc ctcatctccc tgttgtctgt 120 

29 ctcctcttag tgcttcagtg acgctaggtt cggtcacttt gtcccccctt cgttgctctg 180 

30 gtgtgtccaa ggtctaccct gcagtggttt tgaacccttg atatcctgct tgagcatccg 240 

31 cgtcgccata tagagcagca tattcttcta tctccaaaga tcccctcacc gagagttcta 300 

32 ttcacccgac ccttgccttg tcatccagtc cttccatcat ggctgatatt gatgttgagg 360 

33 ccatcttgaa gaagctcacc ctggccgaga aggtcgatct gctggctggt atcgacttct 420 

34 ggcacacaaa ggctctcccc aagcatggag tcccctctct ccgctttaca gatggcccca 480 

35 acggcgtaag agggaccaag ttcttcaatg gcgtccctgc ggcctgcttc ccttgcggca 540 

36 cgtcgctcgg ttccacattc aaccaaactc tgctcgaaga ggcaggtaag atgatgggca 600 

37 aagaggccat cgctaagagt gcgcatgtga tcctcggccc gactatcaac atgcaacgct 660 

38 cccctctcgg tggacgtggc ttcgagtcga ttggtgagga tccgttcctg gcgggcttgg 720 

39 gagctgcggc tctcatccgc ggcattcaga gcactggagt gcaggctacg atcaagcact 780 

40 ttttgtgcaa tgatcaggag gacaggcgca tgatggtgca gagcatcgtc acggagcggg 840 

41 ctctccgtga aatctacgca ctcccgttcc agattgctgt gcgagactcc cagccgggtg 900 

42 cgttcatgac ggcgtacaat ggcatcaatg gcgtgtcgtg cagcgagaac cctaaatatc 960 

43 ttgatgggat gcttcgaaag gaatggggtt gggatggcct aatcatgagc gactggtacg 1020 

44 gcacatacag taccacagaa gccgttgtgg caggcctcga cctcgagatg cccggacctc 1080 

45 cacgcttccg aggagaaaca ctcaagttca acgtctccaa cggaaagccc tttatccacg 1140 

46 tcattgacca gagggctagg gaagttcttc agttcgtcaa gaagtgtgct gcctccggag 1200 

47 tgacggagaa cggccccgag acgactgtca acaacacccc cgaaacggca gctctcctcc 1260 

48 ggaaggttgg caacgagggc atcgtgctgc tgaagaacga gaacaacgtt ctgcccttga 1320 

49 gcaagaagaa gaagacgctg attgtcggcc ccaacgccaa gcaggccaca taccacggcg 1380 

50 gaggctctgc cgcactcagg gcctactacg cagtcactcc ctttgacggc ctcagcaagc 1440 

51 agctcgagac gccgccatcg tacaccgtcg gcgcctacac caccgttcct cccattctag 1500 

52 gcgagcagtg cctcacgccc gacggcgctc cgggcatgcg ctggagggtc ttcaacgagc _ 1560 

53 cccctggtac ccctaaccgc cagcacattg acgagctctt cttcaccaag acggacatgc 1620 
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54 acctggtgga ctactaccac cccaaggcgg cagacacgtg gtacgccgac atggagggca 

55 cgtacaccgc cgacgaggac tgcacctacg agctcggcct cgtcgtctgc ggcacggcaa 

56 aggcgtacgt agacgaccag ctcgtcgtcg acaacgccac caagcaggtc cccggcgatg 

57 ccttcttcgg ctccgccacc cgcgaggaga cgggccgcat caatctcgtc aagggcaaca 

58 cgtacaagtt caagatcgag ttcggctccg cacccaccta caccctcaag ggcgacacca 

59 tcgtccccgg ccacggctcc ctccgcgtcg gcggctgcaa ggtcattgac gaccaggccg 

60 aaatcgaaaa gtccgtcgcc ctcgccaagg agcacgacca ggtcatcatc tgcgcgggcc 

61 ttaacgccga ctgggagacc gagggcgccg accgcgcgag catgaagctc cccggcgtgc 

62 tggaccagct cattgccgac gtggccgccg cgaacccaaa caccgtcgtc gtcatgcaga 

6 3 cgggcacccc cgaggagatg ccctggctcg acgccacgcc cgccgtcatc caggcctggt 

64 acggcggcaa cgagacgggc aactccattg ccgacgtcgt ctttggcgac tacaacccct 

65 cgggcaagct gtccctcagc ttccccaagc gcctgcagga caaccccgcg tttctcaact 

66 tccgcaccga ggccgggcgc acgctgtacg gcgaggacgt ctacgtcggg tacaggtact 

67 acgagtttgc cgacaaggac gtcaatttcc cctttggcca cggcctgtcc tacaccactt 

68 ttgccttttc caatctctcc gtgtctcaca aggacggcaa gctgagcgtg tccctctccg 

69 tgaagaacac cggctccgtg cccggcgcac aggtggccca gctctacgtc aagcccctcc 

70 aagcggccaa gattaaccgc cccgtcaagg agctcaaggg cttcgcaaag gtcgaactgc 

71 agcccggcga gacgaaggcg gtgacaatcg aggagcagga gaagtacgtc gctgcgtatt 

72 ttgatgagga gcgggatcag tggtgtgtcg aaaagggtga ctatgaggtt atcgtgagcg 

7 3 acagcagcgc agcgaaggat ggggttgcgc tcaggggtaa gtttacggtg ggagagacgt 

74 attggtggtc tggcgtgtaa agtcgtgcat catctttggc agattgaatc cagtcacttt 

75 gcatatagcg ccgatgaaat gagaaccaac gatcattgtg taaaaaaaaa aaaaaaaaaa 

76 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 

78 <210> SEQ ID NO: 2 

79 <211> LENGTH: 833 

80 <212> TYPE: PRT 

81 <213> ORGANISM: Trichoderma reesei 
83 <400> SEQUENCE: 2 
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100 Ala Ala Ala Leu He Arg Gly He Gin Ser Thr Gly Val Gin Ala Thr 

101 130 135 140 



102 He Lys His Phe Leu Cys Asn Asp Gin Glu Asp Arg Arg Met Met Val 

103 145 150 155 160 

104 Gin Ser He Val Thr Glu Arg Ala Leu Arg Glu He Tyr Ala Leu Pro 
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105 165 170 175 

106 Phe Gin He Ala Val Arg Asp Ser Gin Pro Gly Ala Phe Met Thr Ala 

107 180 185 190 

108 Tyr Asn Gly He Asn Gly Val Ser Cys Ser Glu Asn Pro Lys Tyr Leu 

109 195 200 205 

110 Asp Gly Met Leu Arg Lys Glu Trp Gly Trp Asp Gly Leu He Met Ser 

111 210 215 220 

112 Asp Trp Tyr Gly Thr Tyr Ser Thr Thr Glu Ala Val Val Ala Gly Leu 

113 225 ' * 230 235 240 

114 Asp Leu Glu Met Pro Gly Pro Pro Arg Phe Arg Gly Glu Thr Leu Lys 

115 245 250 ,255 

116 Phe Asn Val Ser Asn Gly Lys Pro Phe He His Val He Asp Gin Arg 

117 260 265 270 

118 Ala Arg Glu Val Leu Gin Phe Val Lys Lys Cys Ala Ala Ser Gly Val 

119 275 280 285 

120 Thr Glu Asn Gly Pro Glu Thr Thr Val Asn Asn Thr Pro Glu Thr Ala 

121 290 ~ 295 300 

122 Ala Leu Leu Arg Lys Val Gly Asn Glu Gly He Val Leu Leu Lys Asn 

123 305 310 315 320 

124 Glu Asn Asn Val Leu Pro Leu Ser Lys Lys Lys Lys Thr Leu He Val 

125 325 330 335 

126 Gly Pro Asn Ala Lys Gin Ala Thr Tyr His Gly Gly Gly Ser Ala Ala 

127 340 345 350 

128 Leu Arg Ala Tyr Tyr Ala Val Thr Pro Phe Asp Gly Leu Ser Lys Gin 

129 355 ^ 360 365 

130 Leu Glu Thr Pro Pro Ser Tyr Thr Val Gly Ala Tyr Thr Thr Val Pro 

131 370 375 380 

132 Pro He Leu Gly Glu Gin Cys Leu Thr Pro Asp Gly Ala Pro Gly Met 

133 385 390 395 400 

134 Arg Trp Arg Val Phe Asn Glu Pro Pro Gly Thr Pro Asn Arg Gin His 

135 " 405 410 415 

136 He Asp Glu Leu Phe Phe Thr Lys Thr Asp Met His Leu Val Asp Tyr 

137 420 425 430 

138 Tyr His Pro Lys Ala Ala Asp Thr Trp Tyr Ala Asp Met Glu Gly Thr 

139 435 440 445 

140 Tyr Thr Ala Asp Glu Asp Cys Thr Tyr Glu Leu Gly Leu Val Val Cys 

141 450 455 460 

142 Gly Thr Ala Lys Ala Tyr Val Asp Asp Gin Leu Val Val Asp Asn Ala 

143 465 470 475 480 

144 Thr Lys Gin Val Pro Gly Asp Ala Phe Phe Gly Ser Ala Thr Arg Glu 

145 J 485 490 495 

146 Glu Thr Gly Arg He Asn Leu Val Lys Gly Asn Thr Tyr Lys Phe Lys 

147 500 505 510 

148 He Glu Phe Gly Ser Ala Pro Thr Tyr Thr Leu Lys Gly Asp Thr He 

149 515 520 525 

150 Val Pro Gly His Gly Ser Leu Arg Val Gly Gly Cys Lys Val He Asp 

151 530 535 540 

152 Asp Gin Ala Glu He Glu Lys Ser Val Ala Leu Ala Lys Glu His Asp 

153 545 550 555 560 
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187 820 825 830 

188 Val 

191 <210> SEQ ID NO: 3 

192 <211> LENGTH: 2502 

193 <212> TYPE: DNA 

194 <213> ORGANISM: Trichoderma reesei 

196 <400> SEQUENCE: 3 . 

197 atggctgata ttgatgttga ggccatcttg aagaagctca ccctggccga gaaggtcgat 60 

198 ctgctggctg gtatcgactt ctggcacaca aaggctctcc ccaagcatgg agtcccctct 120 

199 ctccgcttta cagatggccc caacggcgta agagggacca agttcttcaa tggcgtccct 180 

200 gcggcctgct tcccttgcgg cacgtcgctc ggttccacat tcaaccaaac tctgctcgaa 240 

201 gaggcaggta agatgatggg caaagaggcc atcgctaaga gtgcgcatgt gatcctcggc 300 

202 ccgactatca acatgcaacg ctcccctctc ggtggacgtg gcttcgagtc gattggtgag 360 

203 gatccgttcc tggcgggctt gggagctgcg gctctcatcc gcggcattca gagcactgga 420 

204 gtgcaggcta cgatcaagca ctttttgtgc aatgatcagg aggacaggcg catgatggtg 480 

205 cagagcatcg tcacggagcg ggctctccgt gaaatctacg cactcccgtt ccagattgct 540 
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206 gtgcgagact cccagccggg tgcgttcatg acggcgtaca atggcatcaa tggcgtgtcg 

207 tgcagcgaga accctaaata tcttgatggg atgcttcgaa aggaatgggg ttgggatggc 

208 ctaatcatga gcgactggta cggcacatac agtaccacag aagccgttgt ggcaggcctc 

209 gacctcgaga tgcccggacc tccacgcttc cgaggagaaa cactcaagtt caacgtctcc 

210 aacggaaagc cctttatcca cgtcattgac cagagggcta gggaagttct tcagttcgtc 

211 aagaagtgtg ctgcctccgg agtgacggag aacggccccg agacgactgt caacaacacc 

212 cccgaaacgg cagctctcct ccggaaggtt ggcaacgagg gcatcgtgct gctgaagaac 

213 gagaacaacg ttctgccctt gagcaagaag aagaagacgc tgattgtcgg ccccaacgcc 

214 aagcaggcca cataccacgg cggaggctct gccgcactca gggcctacta cgcagtcact 

215 ccctttgacg gcctcagcaa gcagctcgag acgccgccat cgtacaccgt cggcgcctac 

216 accaccgttc ctcccattct aggcgagcag tgcctcacgc ccgacggcgc tccgggcatg 

217 cgctggaggg tcttcaacga gccccctggt acccctaacc gccagcacat tgacgagctc 

218 ttcttcacca agacggacat gcacctggtg gactactacc accccaaggc ggcagacacg 

219 tggtacgccg acatggaggg cacgtacacc gccgacgagg actgcaccta cgagctcggc 

220 ctcgtcgtct gcggcacggc aaaggcgtac gtagacgacc agctcgtcgt cgacaacgcc 

221 accaagcagg tccccggcga tgccttcttc ggctccgcca cccgcgagga gacgggccgc 

222 atcaatctcg tcaagggcaa cacgtacaag ttcaagatcg agttcggctc cgcacccacc 

223 tacaccctca agggcgacac catcgtcccc ggccacggct ccctccgcgt cggcggctgc 

224 aaggtcattg acgaccaggc cgaaatcgaa aagtccgtcg ccctcgccaa ggagcacgac 

225 caggtcatca tctgcgcggg ccttaacgcc gactgggaga ccgagggcgc cgaccgcgcg 

226 agcatgaagc tccccggcgt gctggaccag ctcattgccg acgtggccgc cgcgaaccca 

227 aacaccgtcg tcgtcatgca gacgggcacc cccgaggaga tgccctggct cgacgccacg 

228 cccgccgtca tccaggcctg gtacggcggc aacgagacgg gcaactccat tgccgacgtc 

229 gtctttggcg actacaaccc ctcgggcaag ctgtccctca gcttccccaa gcgcctgcag 

230 gacaaccccg cgtttctcaa cttccgcacc gaggccgggc gcacgctgta cggcgaggac 

231 gtctacgtcg ggtacaggta ctacgagttt gccgacaagg acgtcaattt cccctttggc 

232 cacggcctgt cctacaccac ttttgccttt tccaatctct ccgtgtctca caaggacggc 

233 aagctgagcg tgtccctctc cgtgaagaac accggctccg tgcccggcgc acaggtggcc 

234 cagctctacg tcaagcccct ccaagcggcc aagattaacc gccccgtcaa ggagctcaag 

235 ggcttcgcaa aggtcgaact gcagcccggc gagacgaagg cggtgacaat cgaggagcag 

236 gagaagtacg tcgctgcgta ttttgatgag gagcgggatc agtggtgtgt cgaaaagggt 

237 gactatgagg ttatcgtgag cgacagcagc gcagcgaagg atggggttgc gctcaggggt 

238 aagtttacgg tgggagagac gtattggtgg tctggcgtgt aa 
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